Smoothing Noisy Spectroscopic Data with Many-Knot Sphne 

Method 
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In this paper, we present the development of a many-knot sphne method derived to 
remove the statistical noise in the spectroscopic data. This method is an expansion 
of the B-spline method. Compared to the B-spline method, the many-knot spline 
method is significantly faster. 

Smoothing using least square method with B-spline functions is helpful in reducing the 
statistical noise in the spectroscopic data, such as gamma-ray spectrum. However, when the 
amount of points is very large, this method becomes time-consuming because of the need to 
solve nonlinear least squares equations, especially if the initial knots are not well determined 
or too many channels are selected as initial knots. 





(a) (b) 

FIG. 1: (a) The construction of quadric many-knot spline basis function shape; (b) 5-basis many- 
knot splines with each count value = 1. 
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To avoid this problem, a many-knot spline is used in substitution for B-spline for smooth- 
ing noisy spectroscopic data. The many-knot method is an expansion technique of B-spline, 
described by the third author D.X.Qijl, Q]. Its basis function of order k is denoted as 
following: 

qu{x) = Y.mr^> (1) 
where U are coefficients that can be solved easily according to its characteristic and f^^''* is 

ao,ai,...,afc_i are transform coefficients and ^k{x) is B-spline basis function of order k. Using 
fc = 2 as an example, the quadric many-knot spline basis function can be expressed as: 

q2{x) = 2fi2(x) - \P2{x + \) + n^ix - i)] (3) 

with its structure shown as Fig. 1(a). Many-knot spline basis functions have the similar 
characteristics as B-spline basis functions. Importantly, unlike the B-spline method, the 
many-knot spline method can obtain the best fitting curve without solving the equation, 
which can save a lot of time in the calculation. The details can be obtained from Ref . IjJ, 

In the new method, five channels with equal interval in the interesting region are selected 
as initial knots. The average count values of the adjacent channel are calculated for each 
knot. A fitting curve can be constructed by summing the basis spline functions corresponding 
to the initial knots with the average counts values as shown in Fig. 1(b). Next, new channels 
are selected in the middle of each interval as knots and the fitting is repeated until the 
interval is equal to 1. With the increase of initial knots, all the fitting curves form a finite 
set and the best fitting curve can be obtained from this set according to the criterion of 
Reinschj^]. The noisy spectroscopic data as shown in Fig.2(a) is synthesized by gaussian 
functions with superimposed random noise varied as a function of the square root of the 
counts per channel. The fitting curve derived from the quadric many-knot spline method 
is shown as Fig.2(b) corresponding with the result using cubic B-spline shown as Fig.2(c). 
As can be seen from this figure, the curve obtained by the many-knot spline method has 
a similar shape as the curve obtained by B-spline method, its validity will be described 
elsewhere. However, the time consumed by both methods are entirely different as shown in 
Fig.2(d). The time consumed by the many- knot spline method is linear and changes little 
with the increasing of initial knots, compared to exponentially increasing time spent using 
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the B-spline method in the same environment. This difference is more obvious when the 
amount of spectroscopic data is large or when there are too many initial knots. 




1 00 200 300 

Channels 



(a) 




200 300 
Channels 




10 



10 



10 



10" 



200 300 
Channels 



500 



(b) 



■ many-knot method 

■ B-spline method 



9 17 33 

Initial Knots 



65 



129 



(c) (d) 

FIG. 2: (a) Noisy spectroscopic data; (b) Smoothing noisy spectroscopic data with quadric many- 
knot spline method. (c) Smoothing noisy spectroscopic data with cubic B-spline method. (d) The 
time consumed by both methods with different number of channels selected as initial knots. 
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